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ABSTRACT 

We present a simple model for the shape of the distribution function of galaxy peculiar 
velocities. We show how both nonlinear and linear theory terms combine to produce a 
distribution which has an approximately Gaussian core with exponential wings. The 
model is easily extended to study how the statistic depends on the type of particle 
used to trace the velocity field (dark matter particles, dark matter haloes, galaxies), 
and on the density of the environment in which the test particles are. Comparisons 
with simulations suggest that our model is accurate. We also show that the evolution 
of the peculiar velocities depends on the local, rather than the global density. Since 
clusters populate denser regions on average, using cluster velocities with the linear 
theory scaling may lead to an overestimate of the global value of fio • Conversely, using 
linear theory with the global value of fig to scale cluster velocities from the initial 
to the present time results in an underestimate of their true velocities. In general, 
however, the directions of motions of haloes are rather well described by linear theory. 
Our results help to simplify models of redshift-space distortions considerably. 

Key words: galaxies: clustering - cosmology: theory - dark matter. 



1 INTRODUCTION 



The gravitational evolution of the density field in an expand- 
ing Universe modifies the shape of the distribution function 
f{v) dv of the peculiar velocity field. Because of the action of 
gravity, the dark matter distribution at the present time is 
certainly not an ideal gas, so it should come as no surprise 
that the present-day distribution of velocities is quite dif- 
ferent from the Maxwell-Boltzmann. Although it has been 
known for some time that the variance of this distribution 
is sensitive to the background cosmology, the first detailed 
model of how and why the shape of this distribution difi'ers 
from a Maxwell-Boltzmann was discussed by Saslaw et al. 
(1990). 

Here we present a model for the shape of the peculiar 
velocity distribution f(v) dv which captures the essence of 
the evolution driven by gravitational instability. Our model 
explicitly includes the type of particle tracing the velocity 
field: dark matter particles, galaxies, or galaxy clusters. This 
feature of the model is crucial because galaxies and clusters 
trace the underlying distribution of dark matter differently 
and measurements of f{v)dv, in which either galaxies or 
galaxy clusters are used as trace particles (e.g., the Mark III 
sample of Willick et al. 1997, the SFI sample of GiovaneUi 
et al. 1998 and Haynes et al. 1999a,b, and the ENEAR sam- 
ple of da Costa et al. 2000) are widely used to constrain 
cosmological models. 



Section M describes our model. In essence, we write the 
motion of a dark matter particle as the sum of two terms, 
one of which evolves according to linear the ory, and another 
which is inherently nonlinear. Section |2.l| shows that this 
split provides a good approximation to what happens in N- 
body simulations, and Section 2.2 shows that our model pro- 
vides a good de scrip tion of the shape of f{v) for the dark 



matter. Section 



2.3 



describes how the model can be used 
to study the depend ence of the shape of f{v) on the local 
density, and Section 2.6 shows how and why the distribu- 
tion of halo velocities is differen t fr om that of dark matter 
particles. This leads, in Section 2.6, to a discussion of how 



the model can be extended to provide predictions for how 
f{v) depends on galaxy type. 



In addition to studying velocities at the present time, 
we also study how velocities evolve from some early time 
to the present. Section argues that the evolution of halo 
speeds should show some dependence on local density, and 
presents results from simulations showing that this actually 
does happen. We also study how the direction of motion of 
trace particles evolves. We show that linear theory describes 
the directions in which dark matter particles are moving 
today rather badly, but the directions of halo motions very 
well, and discuss some useful consequences of this fact. We 
summarize our findings in Section 
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2 THE DISTRIBUTION FUNCTION OF 
VELOCITIES 

The key assumptions of this model are similar to those dis- 
cussed by Sheth (1996) and Diaferio & Geller (1996) in their 
discussions of the distribution of pairwise velocities. Indeed, 
the model described below is essentially the same as that 
discussed in the second half of Sheth (1996), where a de- 
tailed discussion of the significant differences between our 
approach and that of Saslaw et al. (1990) can also be found. 
The final subsection discusses how the model can be ex- 
tended to describe what happens when dark matter haloes 
or galaxies, rather than dark matter particles, are used to 
construct the statistic. 



2.1 Assumptions 

All dark matter particles are assumed to be in approximately 
spherical, virialized haloes. The velocity of a dark matter 
particle is assumed to depend on two variables: the mass of 
the halo in which it is, and the local density in which the 
parent halo is. Although the local density is, in principle, 
a function of smoothing scale, we show below that if this 
scale is chosen so that it typically contains many halos, then 
our results are relatively insensitive to the exact choice. Let 
p{v\m, 5) dv denote the probability that a particle in a halo 
of mass m which is in a region within which the average 
density is (l + S) times the background density, has velocity 
in the range dv about v. Then the distribution of interest 
is given by summing up the various p(v\m,5) distributions, 
weighting by the fraction of particles which are in haloes of 
mass m in regions of overdensity 5: 



/w 



J d5p{S) J dm m n{Tn\S) p{v\m, 5) 
J d5p{5) J dmmn{m\6) 



(1) 



where n{m\S) dm is the number density of haloes that have 
mass in the range dm. about m and are in regions within 
which the average density is 5, and p{S) dS is the fraction 
of regions which have density in the range dS about S. The 
weighting by m reflects the fact that the number of dark 
matter particles in a halo is supposed to be proportional to 
the halo mass. This expression holds both for the size of the 
velocity vector itself, which we will often call the speed, as 
well as for the individual velocity components. 

If we rearrange the order of the integrals in the de- 
nominator, and then integrate over 5, then what remains is 
J dm, m n{m) , where n{m.) is the average density of haloes of 
mass m averaged over all environments, so it is often called 
the universal halo mass function. Because all particles are 
assumed to be in haloes, this universal mass function is nor- 
malized so that J dm, m n{m,) equals the background density 

P- 

To proceed, we need estimates of n{m\5) and of 
p{v\m,S). In what follows, we will study the case of clus- 
tering from Gaussian initial conditions in detail. If the ini- 
tial conditions are non-Gaussian, n{m,\S) and p{v\m,S) are 
of course different from what we use, but our equation (fil) 
is still valid. 

For hierarchical clustering from Gaussian initial condi- 
tions, a good a pproximation to the universal number den- 
sity n{m,) of haloes of mass m is given, for arbitrary power- 
spectra, cosmology and times, by Press & Schechter (1974). 



In what follows we will use the simple modification to this 
formula provided by Sheth & Tormen (1999) which is con- 
siderably more accurate: 



n(m) = -— 




exp 



dlnm. 



(2) 



where p is the average background density, A = 0.322, 
a = 0.707, p = 0.3 and u — Sc/(j{m,) with 5^ ~ 1.686 and 
a^{m) represents the variance in the initial density field, 
when smoothed on the scale R = (3771/471^)^'"^, extrapolated 
to the present time using linear theory. The Press-Schechter 
formula has A = 0.5, a = 1 and p = G. Sheth, Mo & Tor- 
men (2000) argue that whereas the Press-Schechter formula 
is associated with models in which haloes form from a spher- 
ical collapse, this more accurate formula is associated with 
ellipsoidal collapse. 

What we really need is n(m\5). Unfortunately, the 
simplest approximation to this distribution is not accu- 
rate (Lemson & Kauffmann 1999; Sheth & Lemson 1999): 
n{m\5) 7^ (1 -I- 5)n{m). However, a good approximation to 
the actual distribution can be computed following the work 
of Mo & White (1996): 



n{m\S) 



1 -I- b{m)5 n(m 



(3) 



where h{m) depends on the halo mass (Mo & White 1996) 
and on the shape of n(m) (Sheth & Tormen 1999). This 
approximation is accurate provided that 5 is defined by 
smoothing the density field on scales that are sufficiently 
large that a randomly placed cell contains many haloes. 
Equations (10) and (11) in Sheth & Tormen (1999) give 
the 6(m) relation associated with the 71(771) distribution we 
use in this paper (our equation bl). For massive haloes, 6(771) 
increases with m, so the ratio of the number of massive to 
less massive haloes is larger in dense regions than in under- 
dense regions. The exact form of the predicted dependence 
on 5 is slightly more complicated than the simple approxi- 
mation shown above, and is in reasonably good agreement 
with simulations (Sheth & Tormen 2000). When computing 
the model predictions which follow, we actually use the exact 
formula for n{m\5), rather than the simpler approximation 
shown above. 

We now turn to the other term, p{v\ra,5). To model 
p{v\m, 5) we will assume that the velocity of any given dark 
matter particle is the sum of two terms, 



■ Wvir + "halo 



(4) 



the first is due to the velocity of the particle about the centre 
of mass of its parent halo, and the second is due to the 
motion of the centre of mass of the parent. We will assume 
that each of these terms has a dispersion which depends on 
both halo mass and on the local environment, so that 



O- (771,(5) = Cr^,i,(7n,(5) +0-halo(w7,(5). 



(5) 



The assumption that the virial motions within a halo are 
independent of the halo's environment, is probably reason- 
ably accurate. It is not clear that the same is true for the 
halo speeds. Indeed, in the next section we will show that 
haloes in dense regions move faster than those in under- 
dense regions. It will turn out, however, that the fraction 
of regions in which cr^aio('Ti, 5) is significantly different from 
''"haio('Ti, 0) is quite small. This means that neglecting the 
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density dependence of the second term should be a reason- 
able approximation. 

In what follows, we will assume that the dispersion de- 
pends on the mass of the parent halo, but not the local 
density: a'^{m,5) = a^{m). In fact, we will make the even 
stronger assumption that what is true of the second moment 
is also true of the distribution itself: p{v\m, S) = p{v\m). In 
this case, we can rearrange the order of the integrals in the 
numerator of equation (nl), and then integrate over S to get 



/M = 



n{m) 
dmm — —^p{v\m) 



(6) 



where we have used the fact that the integral over 5 gives 
the universal mass function, and the integral of m times the 
universal mass function over all m gives the average density. 
To proceed, we need a model for the actual shape oi p{v\m). 
Since v is the sum of two random variates, we study each in 
turn. 

Consider the first term, Uvir. We will assume that virial- 
ized haloes are isothermal spheres, so that the distribution 
of velocities within them is Maxwellian. This is in reason- 
able agreement with simulations. Accounting for the fact 
that haloes really have more complicated density and veloc- 
ity profiles is a detail which complicates the analysis, but 
not the logic of our argument. The main reason for this is 
that in the isothermal sphere model, the velocity dispersion 
of particles within the halo is independent of where they are 
within the halo. 

What is the dispersion of this Maxwellian? At any given 
time, the average density of a virialized halo is approxi- 
mately independent of the halo mass. This means that the 
mass m and size r of a halo are related: 7n/(47rr''/3) is 
a constant, typically equal to about 200 times the critical 
density at that time. This, with the virial requirement that 
m/r oc CTvin implies that the dispersion of the Maxwellian 
depends on halo mass: a^iy.{m,a) oc rr?'^ . The constant of 
proportionality depends on time a and cosmology, and on 
the exact shape of the density profile of the halo. We will 
use the relations provided by Bryan & Norman (1998) to set 
this constant: 

1/3 



lOiSAfe/Zi 



v^ <') 



where g^ = 0.9, Ani = ISvr^ -I- 60a; - 32x^ with x = n{a) - 1, 
n{a) = {no/a^)/E^{a), E'^{a) = D,o/a^ + ^r/o^ + Q,A,a^ 
1/(1 + z), where z is the redshift, so a = 1 at the present, 
O.o{a) is the ratio of the energy density in matter to the 
critical density, and Q.h denotes the corresponding ratio for 
the energy density associated with a cosmological constant. 
Fig. hq below shows that we are safe in assuming that this 
virial term is independent of local environment. 

We turn, therefore, to the second term, Hhaio- It will 
prove more convenient to first study halo speeds after aver- 
aging over all environments, before considering the speeds as 
a function of local density. (This parallels the order in which 
we presented the halo mass function and its dependence on 
density.) Consider a halo of size r at the present time. Be- 
cause the initial density fluctuations were small, the particles 
in this halo must have been drained from a larger region R 
in the initial conditions: R/r ~ A^^j , where Ani ~ 200 or 
so (see text following equation H). This means, for exam- 
ple, that massive haloes were assembled from larger regions 



than less massive haloes. Suppose we compute the rms value 
of the initial velocities of all the particles which make up a 
given halo. If we do this for all the haloes of mass m, then 
this is similar to computing the rms velocity in linear theory, 
smoothed on the scale R{m) oc m^". 

It is well known that the linear theory prediction for 
the evolution of velocities is more accurate than the linear 
theory prediction for the evolution of the density. Although 
linear theory provides a useful estimate of the initial rms 
velocities of patches in the initial conditions that become 
haloes at z = 0, Colberg et al. (2000) showed that it is more 
accurate to assume that the massive haloes are associated 
with peaks in the initial fluctuation field. Bardeen et al. 
(1986) showed that the linear theory rms velocity of regions 
which are peaks when smoothed on the scale R{m) is slightly 
smaller than that of average regions of the same scale, and 
Colberg et al. showed that the peak estimate agrees with 
the initial rms velocities of the haloes in their simulations 
to within about 20% or so. Colberg et al. only studied the 
most massive objects in their simulations; in what follows, 
we will show that the linear theory rms velocities of peaks 
provides an accurate estimate of the initial rms velocities of 
haloes of all masses. 

Since we are interested in the rms velocities of haloes 
at the present time (i.e., not in the initial conditions), we 
must know how these velocities evolve. Colberg et al. (2000) 
showed that scaling the rms velocities of peaks using the lin- 
ear theory provided an underestimate of the actual growth 
of the rms velocities of the massive clusters in their simula- 
tions. They showed that this procedure provides velocities 
which are accurate to within 40% or so at z = 0. While 
this is good enough for the results presented in the first part 
of this section (we will discuss why shortly), many of the 
results to follow are sensitive to this difference. Therefore, 
Section a studies the reason for this underestimate. 

To summarize: in what follows, we will assume that at 
the present time, the velocities of all haloes, not just the 
massive ones, are reasonably well described by extrapolating 
the velocities of peaks (smoothed on the relevant scale: R oc 
m^'^) using linear theory. For Gaussian initial conditions, 
this means that any given value of Uhaio is drawn from a 
Maxwellian with dispersion a^^i^{m) given by: 



o"haio(m) = Hoflo'^ (J-1 \J\ - (jfjala'i^, 



(8) 



where Hq and fio are the Hubble constant and the density 
parameter at the present time. 



-.M-^ 



dfc fc^+^^P(fc) W^[fci?(m)], 



and W{x) is the Fourier transform of the smoothing win- 
dow. For the TopHat in real space that we will use in the 
remainder of this paper, W(x) — (3/a;^) [sin(a;) — x cos(a::)]. 
Notice that the predicted rms velocity depends both on cos- 
mology and on the shape of the power spectrum. The term 
under the square-root arises from the peak constraint — it 
tends to unity as m decreases: the peak constraint becomes 
irrelevant for the less massive (small R) objects. 
We have found that 



o-haio(m) 



0"fit 



1 + {R/R&tY 



(9) 
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Figure 1. Dependence on halo mass of the nonlinear (a"vir) and 
linear theory (chaio) terms in our model. Solid curves show the 
scaling we assume, and symbols show the corresponding quanti- 
ties measured in the z = output time of the SCDM GIF simu- 
lation. Error bars show the 90 percentile ranges in mass and ve- 
locity. Dashed curve in panel on right shows the expected scaling 
after accounting for the finite size of the simulation box. Sym- 
bols and curves in the bottom of the panel on the right show the 
predicted and actual velocities at 2; = 20. 



provides a good fit to equation (fel) in the CDM family of 
models; Table 1 gives the values of am, Rm and rj for the 
three representative models we consider in this paper. With 
these values, the fit is better than about 2% for SCDM, and 
about 1% for ACDM, over the range 0.05 < R <50 Mpc/h. 

Fig. hi compares the dependence on mass we assume 
for our two terms with that measured in numerical simu- 
lations of clustering in an SCDM universe (see Kauffmann 
et al. 1999 for details of the GIF collaboration simulations) . 
The symbols with error bars show the median and ninety 
percentile ranges in mass and velocity. Open squares, filled 
squares, open circles and filled circles show haloes which 
have 60-100, 100-10^, 10^-10"* and lO'^-lO'' particles, respec- 
tively. There are two sets of symbols in the panel on the 
right. For the time being, we are only interested in the sym- 
bols in the upper half which show halo velocities at z — 0. 
The solid curves in the two panels show the scalings we as- 
sume. 

Although the scaling of the virial term with mass is 
quite accurate, it appears that the extrapolated linear the- 
ory velocities are considerably in excess of the measurements 
in the simulations. This is almost entirely due to the finite 
size of the simulation box. The upper dashed curve shows 
the effect of using equation (^ to estimate the rms speeds 
of halos, after setting P{k) — for k < 2tt/L, where L is 
the box-size: L = 85 Mpc/h for SCDM and it is 141 Mpc/h 
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Figure 2. As for the previous Figure, but for ACDM. Since the 
simulation box is larger, finite size effects are less dramatic. 



for ACDM. Thus, the two panels show that our simple esti- 
mates of the two contributions to the variance of p(«|77i) are 
reasonably accurate. 

Fig. y shows a similar comparison between our model 
scalings and the GIF simulation of a ACDM cosmology. In 
this case, the simulation box is larger, so the finite box- 
size effects on the halo velocities are much less dramatic. 
Although the accuracy of the scaling of the virial term with 
mass is well known, the reasonably good agreement of the 
extrapolated linear theory peak velocities, over the entire 
mass range present in the simulations, is new. 

Notice that the two terms scale differently with halo 
mass; indeed, to a first approximation, one might even ar- 
gue that halo speeds are independent of halo mass. Since our 
model for the average halo speeds is really only accurate to 
within about 40% or so (e.g. Colberg et al. 2000 for the mas- 
sive haloes), it is interesting to consider how much we expect 
our use of linear theory peak velocities, and our neglect of 
the possibility that the speeds may depend on environment, 
to influence the results which follow. In the ACDM model 
at small masses, the linear theory prediction is not affected 
by box size (the solid and dashed curves in the panel on 
the right are similar), and it provides a good description of 
the average halo speeds. It is at larger masses where the in- 
accuracy of linear theory may be problematic. Fortunately, 
Fig. y shows that ahaioim) < avirim) for massive haloes. 
Since massive haloes have larger dispersions than less mas- 
sive haloes, the large velocity tail of f{v) is determined pri- 
marily by the nonlinear virial motions within massive haloes, 
rather than by the peculiar motions of the halo centres of 
mass. For this reason, the large velocity tails of f{v), at least, 
are unlikely to be sensitive to inaccuracies in our treatment 
of halo velocities, or to our neglect of the possibility that 
halo speeds may depend on their environment. 

We are finally in a position to specify our model for 
p{v\m). Recall that the virial motions are Maxwellian, and 
that, for Gaussian initial density fiuctuations, our linear 
peaks theory model of the halo motions means that they 
too are Maxwellian. Thus, in our model, each of the three 
cartesian components of the velocity of a dark matter parti- 
cle in a clump of mass m is given by the sum of two Gaussian 
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Figure 3. The distribution of one-dimensional peculiar veloci- 
ties for dark matter particles in a SCDM cosmology. Histograms 
show the distribution of the three cartesian components mea- 
sured in the GIF simulations. Dashed and dot-dashed curves show 
Gaussian and exponential distributions which have the same dis- 
persion. The solid curve shows the distribution predicted by our 
model, after accounting for the finite size of the simulation box. 
The exponential wings are almost entirely due to virial motions 
within haloes. 



distributed random variates, one with dispersion a^ir(m)/3 
and the other with 0"^jjiQ(m)/3. If we further assume that the 
motion around the clump centre is independent of the mo- 
tion of the clump as a whole, so these two Gaussian variates 
are independent, then p{v\m) is Maxwellian (equation [13 
gives the exact shape) with a dispersion which is the sum of 
the individual dispersions (the sum in quadrature of equa- 
tions M and H). This model for p{v\m) can be thought of 
as a simple way in which the contributions to the velocity 
distribution statistic are split up into a part that is due to 
nonlinear effects (the first term) and a part which follows 
from extrapolating linear theory to a later time (the second 
term). 

Our model should be reasonably accurate even if the ini- 
tial conditions were non-Gaussian. Of course, non-Gaussian 
initial conditions have different n{m) and n{m\S) distribu- 
tions than the ones we use here, though they can be com- 
puted from the statistics of the density fluctuation field 
similarly to how they were computed in the Gaussian case 
(e.g. Lucchin & Matarrese 1988; Robinson & Baker 1999; 
Sheth 2000). Although the virial motions are likely to still 
be Maxwellian, the fact that the linear peak theory veloci- 
ties studied above were Maxwellians is specific to Gaussian 
initial conditions. In general, p{v\m,S) will likely be differ- 
ent from a Maxwellian, although again, in linear theory, it 
too can be computed from the statistics of the initial density 
field. 



2.2 Dark matter particles 

Having described our model assumptions, we are ready to 
compare our predictions with simulations. In practice, we 
are only likely to observe velocities along the line of sight. 
This means that we will eventually be interested in the dis- 
tribution of f{v) projected along the line of sight. Projection 
changes the Maxwellian p(w|m) distributions into Gaussians: 



Figure 4. As for the previous Figure, but for ACDM. 



p{v\m) 



-[^/<T(m)]2/2 



^27ra2 



(10) 



where a^{m,) is one third of the sum in quadrature of equa- 
tions (W) and iM). Figs. H and W show the one-dimensional 
f{v) distribution for two representatives of the CDM family 
of models presented in Figs, hi and GI The histograms show 
the distribution measured in the GIF simulations. For com- 
parison, the dashed and dot-dashed curves in each panel 
show Gaussians and exponential distributions which have 
the same dispersion. The solid curves show the distribution 
predicted by our simple model (equations pi Gi and hol) . Ex- 
ponential wings, and a small \v\ core that is more Gaussian 
than exponential are a generic prediction of our model. The 
exponential wings are almost entirely due to nonlinear mo- 
tions within massive haloes, so they are fairly insensitive 
to our assumptions about how fast these haloes move. We 
conclude that our model provides a reasonably good descrip- 
tion of what actually happens in the simulations. It is worth 
emphasizing that a{m) in equation (FKj) is set by the cosmo- 
logical model and the initial conditions. Thus, the second 
moments of the distributions in FigsO and W are not free 
parameters of our model. The agreement with simulations 
suggests that our simple treatment of nonlinear and linear 
contributions to the statistic, and our assumption that any 
dependence on local density can be neglected, are sufficiently 
accurate. 



2.3 Dependence on local density 

Kepner, Summers & Strauss (1997) argue that it may be use- 
ful to compute velocity statistics as functions of local den- 
sity, where the local density is defined as the mass contained 
within some larger smoothing scale, divided by the volume of 
the larger scale. Kepner et al. were actually interested in the 
pairwise velocity distribution, rather than the single particle 
distribution function considered here, but, following Sheth 
(1996) and Diaferio & Geller (1996), our discussion below of 
f{v\S) applies equally to the pairwise statistic in which they 
were interested. Our calculation of f{v) is easily extended 
to provide a model of f{v\6). 
If we require that 
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Figure 5. As for the previous Figure, but using only tlie particles 
which were in the densest regions of a ACDM model. The density 
was defined by smoothing the density (but not the velocities) with 
a Gaussian filter of size _Rg = lOMpc/h. Figure shows results for 
the densest 5% of the particles. 



/(«) 



/(^|<5)p(5)d5 



(11) 



where piS) is the fraction of cells which have overdensity 
5, and we have not written explicitly that both terms in 
the integrand depend on the smoothing scale used to define 
the local density, then equation (nl) shows that the density- 
dependent statistic of interest in this section is 



j\v\b)= p(v\m,d) am. 



(12) 



In the context of our model, f{v\S) may depend on local 
density either because the distribution of halo masses de- 
pends on local density or because the distribution of halo 
speeds depends on the local density, or both. 

We have already discussed how the number density of 
haloes of mass m depends on S (equation m) , We now turn to 
the other term, p{v\m,S). Recall that the assumption that 
the virial term is independent of S is accurate. If halo speeds 
are also independent of their environment, then p{v\m,5) 
will be the same sum of two Gaussians that we used in 
the previous section: p{v\m,5) — p{v\m). In other words, 
if halo speeds are independent of their surroundings, then 
f(v\S) will difi'er from the global /(«) simply because mas- 
sive haloes, which have higher virial motions, occur predom- 
inantly in dense regions. Similarly, because less dense cells 
have fewer massive halos than average, the virial contribu- 
tion in underdense cells is slightly lower than average. So, 
the rms velocity in less dense cells should be smaller than in 
denser cells. 

Section H shows that, in fact, haloes in denser regions 
move faster than haloes in less dense regions. Although the 
shape of the distribution of the halo speeds remains approx- 
imately Gaussian, the dispersion depends on density. We 
incorporate this into our model by setting 



p{v\m,5) 



where 



-l3v^/cr^{m,S)]/2 



y/2TTa^{m,S)/3 



(13) 
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Figure 6. As for the previous Figure, but for underdense regions; 
the least dense 5% of the particles. 
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and where the two cts are given in equations (m) and (W . As 
discussed in the next section, jj, depends on the cell size on 
which 5 was defined {fi = 0.6 for the results which follow). 
In the previous subsection, we were able to neglect this de- 
pendence on S because the fraction of regions in which the 
dispersion was significantly different from the average value 
was small. Here, because we are studying the distribution as 
a function of 5, we cannot ignore this dependence. 

The histograms in Figs. M and H show the f{v\S) distri- 
bution for the densest and least dense regions in the ACDM 
simulations. They were constructed by computing the den- 
sity field on a 256^ grid with the cloud-in-cell technique, and 
smoothing the density field using a Gaussian filter of scale 
Rg ~ 10 Mpc/h. Each particle was then assigned a density 
equal to that of the nearest grid point. The figures show the 
distribution of speeds for the densest and least dense 5% of 
the particles. The average overdensity of these particles is 
given in the top right hand corner of the two panels. 

To compute the associated theory curves we must in- 
tegrate over the probability p{S) that a cell has overdensity 
5. Since we simply wish to illustrate that our model can 
account for the striking differences between the two plots, 
we will show the result of simply using the value of the in- 
tegrand at the mean value of the overdensity, rather than 
doing the integral exactly. To compute this representative 
overdensity, we must account for the fact that the theory 
requires the value of the density in a tophat filter whereas 
the simulations were smoothed with a Gaussian filter. This 
has two consequences. First, if we require the two filters to 
contain the same mass, then the scale associated with the 
tophat is about Rth ~ 1-6 -Rg. In addition, we must account 
for the fact that the mean value of the overdensity in such a 
tophat is different from that in the Gaussian. If the distribu- 
tion of the smoothed Sq and (5th fields were Gaussian, then 
we could compute the average {5th, Rth\Sg , Rg) ■ This ex- 
ercise shows that the mean value of the tophat fiuctuation is 
typically larger (smaller) than that of the Gaussian in over 
(under) dense regions. This suggests that to compare with 
theory, we should use -Rth = 16 Mpc/h, and the associated 
value of the overdensity. Using (1 -f- <5th) = 3.5, and 0.4 for 
the two cases provides rms values which are similar to those 
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of the simulation. These values in our model give the solid 
curves in the two panels. The figures show that our model 
describes the differences between the two regimes quite well. 
Including the fact that halo speeds depend on their en- 
vironment (haloes in dense regions move faster than they 
do in underdense regions) was essential to reproduce the 
simulation results: whereas the dependence on density of 
the halo mass function (equation 0) controls the tails of 
the distribution, it is the dependence of halo speeds on 
density (equation |l3| ) which controls the width of the cen- 
tral core. Neglecting this dependence produces distributions 
which have narrower/broader cores than those measured in 
the dense/underdense regions. Section M discusses why halo 
speeds depend on local density. 

2.4 Dark matter haloes 

Our model can also be used to estimate what happens to 
the shape of f{v) if something other than dark matter par- 
ticles are used as trace particles. For example, suppose we 
wish to construct the distribution of velocities if only haloes 
within some mass range are used. This was done by Croft & 
Efstathiou (1994). They found essentially no dependence of 
CThaio on cluster mass for the massive clusters in their simula- 
tions. They also reported that, in the tails, the distribution 
of massive cluster velocities differed from a Maxwellian. Our 
model provides a simple explanation for their findings. 

In our model, the distribution of halo speeds is given by 
equation (mj, but now each halo contributes only once, so the 
weighting by mass should be removed (both from the numer- 
ator and the denominator). Also there should be no contri- 
bution from the Vvir term: so p{v\m,S) is a Maxwellian (a 
Gaussian in one-dimension) with dispersion equal to the sec- 
ond term, cr^aio('^i ^)) in equation (M). If the range in halo 
masses and environments considered is sufficiently small, 
then f{v) will be approximately Gaussian; departures from 
the Gaussian shape will become increasingly apparent as 
the range in m and 5 is increased. Since ffhaio depends only 
weakly on halo mass (Figs, hi and Q; note that this is con- 
sistent with Croft & Efstathiou) any departures from Gaus- 
sianity are almost entirely due to the dependence of halo 
speeds on S. 

As we show in the next section, the halo speeds do de- 
pend on S. We were able to neglect this dependence when 
studying the speeds of dark matter particles, because the 
large velocity non-Gaussian tails were determined almost 
entirely by the virial motions of the particles, and not by 
the halo speeds. Since these virial motions do not contribute 
to the motions of the haloes themselves, they do not mask 
the effect of the density dependence of halo speeds on the 
distribution of Whaio- Thus, in our model, both for the dark 
matter, and for the haloes, the distribution of speeds is pre- 
dicted to have a non-Gaussian tail which is a consequence 
of adding up Gaussian distributions having different disper- 
sions. However, whereas for the dark matter, this tail arises 
more from the mass dependence of the virial motions within 
dark haloes than from any dependence on environment, the 
tail in the distribution of halo speeds arises more from the 
fact that the halo speeds depend on environment than on 
halo mass. 

To illustrate this, panels on the left of Fig. vA show the 
distribution of halo speeds we obtained after splitting the 
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Figure 7. Dependence of the distribution of halo speeds on 
environment. Top left panel shows the distribution of three- 
dimensional speeds for 50% of the haloes, chosen because the lo- 
cal density around them is high, and bottom left panel shows the 
distribution for the other half. Top right panel shows the distribu- 
tion when only 5% of the haloes, having the highest local density, 
are used. Bottom right panel shows the distribution of speeds 
for the total halo sample; it is quite different from a Maxwellian 
(dashed curve). Dot-dashed curve shows the corresponding Exp- 
wellian distribution associated with exponential one- dimensional 
distributions. Solid curves show what our model predicts. 



total halo population into two samples, each containing half 
of the total sample, based on the value of the local density 
around each halo. The top left panel shows the distribution 
for the haloes in the dense regions, and the bottom left panel 
shows the distribution for the haloes around which the local 
density is smaller. The local density of a halo was computed 
by smoothing the dark matter field onto a 256"^ grid using a 
Gaussian filter of scale Rg = 10 Mpc/h, as described in the 
previous subsection. The local density of a halo was then set 
equal to the density of the nearest grid point. 

For comparison, we also show two other distributions: a 
Maxwellian, and an 'Expwellian'. Whereas a Maxwellian has 
independent Gaussian variates for each of the three cartesian 
components of the velocity, the Expwellian has components 
which are exponentially distributed. The Maxwellian has the 
form 



/mb(«. 



'2v 



d_ 

dv 



'{v/^r/2 



(15) 



\/27ra2 

where a is the rms of each of the components, whereas the 
Expwellian is 



/Exp(f,(T) 



-2w 



d_ 

dv 



-V2(v/a) 



(16) 



This way of writing the three-dimensional distributions fol- 
lows from the assumption that the vector is drawn in a ran- 
dom direction and its length is independent of its direction; 
this in turn implies that its components have identical one- 
dimensional distributions (Feller 1966). 

It would appear that the true distribution (histograms) 
is Maxwellian (dashed curves) in the less dense regions, but 
more 'Expwellian' (dot-dashed curves) if the local density is 
higher. This is consistent with our model (solid curves), be- 
cause the range of 5 is much smaller in the underdense than 
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the overdense regions. To illustrate that the non-Gaussian 
tail really does arise from mixing Maxwellian distributions 
with different dispersions, and not from velocities which are 
intrinsically non-Gaussian, the top right panel shows the 
distribution of speeds computed using haloes in the denser 
regions, but now with the cut at 5% rather than 50% of the 
total sample. In this case, even though the average value of 
the local density is considerably larger, the range of over- 
densities is smaller, and so our model predicts that the dis- 
tribution should be well fit by a Maxwellian (with a larger 
dispersion, of course). The figure shows that this prediction 
is correct: even in these densest regions, where the haloes 
are moving significantly faster than in the rest of the simu- 
lation box, the components of the velocity are approximately 
Gaussian distributed. 

The figure shows results for the full three-dimensional 
distributions; we did this because the one-dimensional dis- 
tributions look quite Gaussian even when the range in 5 
is large. At first, this may seem unexpected: one usually 
thinks of the three-dimensional distribution as a sum of in- 
dependent one-dimensional distributions, so if each compo- 
nent is Gaussian, then the full three-dimensional distribu- 
tion should be Maxwellian. The reason why this does not 
happen is that the large v non-Majcwellian tail comes from 
Gaussians which have large dispersions. If the dispersion is 
large, then all three components are likelier to be large, so 
the three-dimensional speed is that much more likely to be 
large. Since the final distribution we are considering here 
is a sum over Gaussians of different dispersions, the dif- 
ference between the three-dimensional distribution and a 
Maxwellian appears to be larger than the difference between 
each of the components and a Gaussian. Another way of 
saying this is that the departures from a Gaussian distri- 
bution are significant when f{v) < 10^'', whereas the de- 
partures from Maxwellian are significant when f{v) is an 
order of magnitude or so larger. In effect, this means that 
it takes fewer haloes to measure this difference reliably if 
one uses the three-dimensional distribution rather than the 
one-dimensional one. An important consequence of this is 
the following. Suppose that only one component of the ve- 
locity was measured and found to be approximately, but 
not exactly, Gaussian. Then the discussion above cautions 
strongly against concluding that the three-dimensional ve- 
locities were drawn from a Gaussian random field. 



2.5 On isotropy and correlated components 

There is an interesting aspect of our model which we have 
not discussed. The Maxwell-Boltzmann distribution associ- 
ated with linear gravitational instability theory and Gaus- 
sian initial density fluctuations has the special property that 
the three components of the velocity are independent Gaus- 
sians, and the resulting velocity fleld is isotropic. If the com- 
ponents are non-Gaussian, then isotropy requires that the 
components be correlated (e.g. Feller 1966). We will show 
this explicitly below for a few toy examples. 

Consider what happens when the velocity is the sum 
of two rather than three components. We will study the 
distribution of the angle between the vector and the x- 
axis in two steps; we will derive the distribution Q{t) of 
r = tanO = Vy/vx first, and we will then use the fact that 
dr/d^ = (1 -I- r^) to compute the distribution of 6 itself. 



And, for simplicity, we will only study the distribution in d 
in the regime where both Vx and Vy are positive. (Of course, 
the distribution in the other quadrants can be derived anal- 
ogously.) If Vx and Vy are independent, but have the same 
distribution p, then 



(17) 



Q(r) = / p{vx)p{vy = TVx)vxdVx- 

lip is Gaussian or exponential, then 

QGauss(r) = {27t)-'{1 + t^)-' and 



QExp(-r) 
so that 



(1/4) (1 + r)- 



qciO) = l/27r and 
gExp(6') = (1/4) (sine -I- cos ( 



(18) 



(19) 



Whereas the distribution of 6 associated with independent 
Gaussians is uniform, the one associated with Exponentials 
is not; it has a minimum at 7r/4. To get a uniform distribu- 
tion in 6, Vx and Vy cannot be independent if they are not 
Gaussian. Had we chosen p distributions which were less 
centrally peaked than a Gaussian (the exponential is more 
centrally peaked) , we would have found q distributions which 
have maxima, rather than mimima, at 7r/4. E.g. a uniform 
distribution on the range < v < V has q{9) oc cos~^ 6 and 
sin"^ 9 ioT < 9 < 7v/4 and 7r/4 < 6* < -7r/2, respectively. 

One consequence of this is that if one wishes to model 
the nonlinear velocity field, and one requires that the non- 
linear field be isotropic, then one's model must have correla- 
tions between the components of the velocity. Building such 
a model is not easy; of course, perturbation theory may be 
helpful here. We were able to circumvent this problem by 
assuming that, for sufficiently small ranges in mass and en- 
vironment, the velocities are, in fact, Gaussian. This allowed 
us to use all the nice features of the Gaussian to build our 
model relatively easily, without worrying about correlations 
between the different velocity components. In our model, 
the total velocity distribution is built up by summing over 
many different Gaussian distributions. Since each of these is 
isotropic, the final field is also. Because the final distribu- 
tion is non-Gaussian, but isotropic, the resulting distribu- 
tion must have correlations between x, y and z components 
of the velocity built in, even though we never explicitly wor- 
ried about how to model them correctly. 

We have measured these correlations in the simulations 
using Spearman's rank correlation p for the three pairwise 
permutations of Vx, Vy, and v^ for the total halo population 
(those in the bottom right panel of Fig. M) . In all three cases 
we found values of about p = 0.075 with significance lev- 
els (which should lie between zero and one) of about 10~'^°. 
These low values indicate that the correlation is extremely 
significant. The same test, with two independent normal 
variates for the components (same sample size of the ha- 
los A'^ — 14089), yields p = 0.0005 and significance level 
0.95, confirming that there is no correlation in this case. We 
then repeated this test using only those halos in the densest, 
and the least dense regions. Recall that our model assumes 
that, for a sufficiently small range in environment, the dis- 
tribution should be Gaussian. In both these cases we find 
that this rank correlation statistic suggests that the velocity 
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components are independent and that therefore their distri- 
bution is, indeed, Gaussian. 



2.6 Galaxies 

Galaxies can also be treated within the context of our model, 
provided we assume that all galaxies form within dark mat- 
ter haloes and that, for the most part, any given galaxy con- 
tains a small enough fraction of the mass of the parent halo 
that it can be thought of as a trace particle within the halo. 
This means that all we now need is a relation which tells 
us how, on average, the number of galaxies scales with halo 
meiss. If iVgai(?n) is known, then we can insert it in place of 
the factor of m in equation (hh. This is clearly a very simple 
assumption, which we will discuss in more detail later. Note 
that this assumption can also be used to model how different 
the spatial distribution of galaxies is from that of the dark 
matter (Seljak 2000; Peacock & Smith 2000; Scoccimarro et 
al. 2000). First, we consider the shape of A'gai(m). 

One might have thought that, on average, the number 
of galaxies in a halo would be proportional to the available 
gas, and that the fraction of gas should be proportional to 
the total mass of the parent halo: A'^gai(m) oc m. However, 
because gas must cool to form stars, and the internal veloc- 
ity dispersions of haloes increase with halo mass (cf. equa- 
tion uh, gas cools less efficiently in more massive haloes. So, 
since the number of galaxies actually depends on the frac- 
tion of gas that can cool, Nga_i{m) oc m'^, with /i < 1, is more 
reasonable. In addition, there is a lower mass cutoff to this 
weighting scheme, which is associated with the fact that if 
the potential well of a halo is not sufficiently deep, the su- 
pernovae formed from the first generation of stars may blow 
the remaining gas out of the halo, thus supressing future 
star formation. 

The fact that Ng^i{m) depends on m has an important 
consequence for our model. If there are many galaxies in a 
halo, then it is reasonable to treat each one as a test particle, 
so that its velocity is similar to that of a dark matter par- 
ticle. However, mass conservation means that a significant 
fraction of low mass haloes may have at most one galaxy 
within them. In such haloes, it makes more sense to iden- 
tify the motion of the galaxy with that of the halo. Even in 
haloes which contain more than one galaxy, there is often 
one dominant central galaxy. It seems sensible to identify 
the motion of this galaxy with that of the parent halo, so 
that for this galaxy also, the virial term does not contribute. 
To model this, we will assume that both the virial and halo 
terms contribute to A^gai — 1 of the galaxies within a halo, but 
that only the halo term contributes for the final remaining 
galaxy. 

Although this A'^gai(?n) weighting scheme for incorpo- 
rating galaxies into our model is extremely simple, we think 
it provides a useful framework for constructing more compli- 
cated, and presumably more realistic, schemes. For example, 
it is known that ellipticals occur preferentially in clusters, 
whereas spirals occur in the field. Also, within a cluster, el- 
lipticals are more strongly concentrated towards the cluster 
centre than are the spirals. The first fact can be incorpo- 
rated into our scheme simply by using two different weight- 
ing schemes: A'bii('ti) and A'^spirai('7i). If ellipticals populate 
more massive haloes, then, since the contribution of massive 
haloes to the peculiar velocity statistic is dominated by the 
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Figure 8. The average number of bright galaxies as a function 
of parent halo mass in the ACDM semi-analytic models of Kauff- 
mann et al. (1999). Top panel shows results for a catalog in which 
no correction for the effects of dust was made. Bottom panel shows 
the galaxy catalog after correcting for extinction. Open circles 
show blue galaxies, open squares show red galaxies, and solid cir- 
cles show the sum of the two populations. Curves show the simple 
fits to these relations discussed in the text. 



virial term which increases with halo mass, one would ex- 
pect the velocity dispersion of ellipticals to be larger than 
spirals. 

The fact that ellipticals are concentrated more strongly 
towards cluster centres than spirals is slightly more com- 
plicated to incorporate. If we continue to assume that the 
galaxies trace the dark matter within the halo, then the as- 
sumption that all haloes are isothermal spheres means that 
we needn't make any change other than to the weighting 
term. That is to say, because the velocity dispersion is inde- 
pendent of position within the isothermal sphere, particles 
that are nearer the centre of the halo move similarly to those 
that are further away. Since, in fact, halo density profiles are 
more complicated than isothermal spheres, the velocity dis- 
persions within the halo are actually functions of position, 
it may be that ellipticals and spirals actually have different 
p{v\m) relations as well. This would, of course, complicate 
the relation between (Jeii and agpirai, though it is likely that, 
in general, cteii > crspirai- 

To illustrate how all this works, we will compare our 
model with results from the semi-analytic galaxy formation 
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models of Kauffmann et al. (1999). These models provide 
the Afgai(m) relations we need for our model. Extinction 
due to dust changes the brightness and colours of the model 
galaxies, so that the A'^gai("i) relation we need depends on 
whether or not this effect has been account for. For this 
reason we have chosen to study two samples, and they are 
presented in the two panels of Fig. H. The top panel shows 
galaxies brighter than My = —17.7 + 5 log ^ in a model in 
which the effects of dust have been neglected, and the bot- 
tom panel shows the A'gai(7Ti) relation for galaxies brighter 
than Mb = —17.7 + 5 log h after accounting for extinction. 
In each panel, the filled circles show all galaxies brighter 
than the magnitude cut. The total population can be split 
into a blue sample (open circles) and a red sample (open 
squares) ; these are the objects which are classified as having 
colours with B — I < 1.7 and B — I > 1.7, respectively. 

In what follows, we will use simple approximations (the 
curves in the Figure) to the semi-analytic results. For the 
galaxies in the top panel, 

iVBiuo(m) = 0.7 if 10" Mq/H <m< Mbiuo 

= 0.7(m/MBiuc)"^ ifm>MBiue 

A^Rcd(m) = (m/MRed)°« m> 10^^ Me/h 

Afgai(m) = iVBiuc(m) + iVRod(m), (20) 

where Msiue = 4 x 10^^ Mq/H, q:b = 0.8, M^ed = 2.5 x 
10^^ Mq/H, and qr = 0.9. Extinction due to dust changes 
the brightness and colours of the model galaxies. So we have 
also studied a sample which contains galaxies brighter than 
Mb = —17.7 4- 5 log/i, after accounting for the effects of 
dust. In this case, we find that 

iVBi..(m) = (m/MBi..)"-+0.5e-^"°^io(Wio"-=)]^ 

iVRed(m) = (m/MRed)"«e-(^>^^°"/'")' 
Afgai(m) = iVBiue(m) + iVRcd(m), (21) 

where Mbiuo = 7 x IO^^A/q/Zi, ob = 0.9, AfRcd = 3 x 
10^^ Mq/H, and qr = 0.75. 

There are two reasons why one might worry that the 
semi-analytic models underestimate the number of bright 
galaxies in low-mass haloes. The first is that the models 
only resolve halos with M > 1.4 x 10^^ Mq/H. Secondly, the 
semi-analytics include a model for dynamical friction which 
may be more efficient than accurate numerical simulations 
suggest (Springel, White, Tormen & Kauffmann 2000). As 
a result, galaxies within a halo may merge with the central 
galaxy faster than they should, and so fewer (or just one) 
bright galaxies survive. On the other hand, in small mass 
haloes there simply isn't enough material available to make 
a massive satellite object in addition to the central galaxy, so 
having a small number of bright galaxies in low-mass haloes 
is certainly sensible. 

Whatever the reason for this low mass cutoff, our main 
purpose here is to show how our model for the dark matter 
f(v) distribution can be extended to describe galaxies. Thus, 
although the low mass cutoff in the Ngaj{m) relation may 
not be in exactly the right place, our main interest is in the 
fact that this cutoff occurs in different places for the blue 
than the red galaxies. This has quite dramatic consequences 
on the predicted f{v) distribution. 

When Kauffmann et al. (1999) identify galaxies in the 
simulations, each halo with at least one galaxy is assigned a 
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Figure 9. Distribution of peculiar velocities of model galaxies 
which, after correcting for the effects of dust, are brighter than 
Mb = —17.7 + 5 log ft,. Histograms show results from the ACDM 
models of Kauffmann et al. (1999). Dashed and dot-dashed curves 
show Gaussian and exponential distributions which have the same 
dispersion. Solid curve shows the distribution predicted by our 
A'gai(m) models (lower panel of previous figure). 



central galaxy. The velocity of this central galaxy is chosen 
to be the same as that of the dark matter particle which 
has the greatest absolute value of the potential energy. In 
the real universe, the central galaxy is plausibly almost at 
rest with respect to the barycentre of its dark matter halo, 
whereas in the simulations the central galaxy moves, on av- 
erage, with a velocity of ~ 80 km/s with respect to the 
barycenter of its dark halo. Therefore, we used the mean 
velocity of the dark particles of the parent halo for the ve- 
locity of the central galaxy. The histograms in Fig. M show 
the resulting distribution of bright galaxy peculiar velocities 
in the extinction-corrected simulations (those in the bottom 
panel of Fig. pi). Results for the simulations in which no 
dust correction was applied are sufficiently similar that we 
decided against showing them here. 

Notice that although /gai(t') has a slightly different 
shape than the dark matter statistic, it still has tails which 
are significantly different from a Gaussian. Dashed and dot- 
dashed curves still show Gaussian and exponential distribu- 
tions which have the same dispersion. The solid curve shows 
what our simple A'^gai (m) model predicts — it provides a good 
description of the simulation results. 

Fig. hd shows how the semi-analytic and our model f{v) 
distributions depend on galaxy type. Notice that the red 
galaxies have larger dispersions than the blue, as expected if 
they occur predominantly in more massive haloes. We have 
chosen to present these results using linear plots because 
peculiar velocity catalogs currently available contain on the 
order of a few thousand galaxies, rather than the 10® or so 
in the simulations. In practice, the errors on each peculiar 
velocity measurement are likely to be on the order of a few 
hundred km/s. These will broaden the curves somewhat, 
making the difference between the red and blue samples less 
dramatic. 
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Figure 10. Distribution of peculiar velocities of model galaxies as a function of colour; colours were assigned after accounting for the 
effects of dust. The panels on the left and right show blue and red galaxies, and the one in the middle shows the sum of the two. 
Line-styles are as in previous figure. 



3 THE EVOLUTION OF HALO VELOCITIES 

This section is motivated by the results of Colberg et al. 
(2000) who reported that linear theory underestimates the 
evolution of the speeds of massive haloes. This underesti- 
mate is more severe when massive halos have near neigh- 
bours. This section argues that, indeed, the evolution of halo 
speeds, and so the accuracy of linear theory, depends not so 
much on halo mass as on local density. 

The previous section showed (Figs, nl and 0) that com- 
bining linear theory with the assumption that haloes at 
z = may be associated with peaks in the initial condi- 
tions provides a reasonably good description of the speeds 
of haloes at z = 0. The lower set of symbols in the panels on 
the right of Figs, nl and show the halo velocities in the sim- 
ulation Sit z — 20. Following Colberg et al (2000), a halo's 
velocity at z; = 20 is computed using the same particles that 
are in it at z = 0, but using the velocities they had at z = 20, 
rather than the velocities they actually have at z = 0. The 
lower dashed curve in the panel on the right was computed 
using equation (H) at z = 20, assuming that there was no 
power in modes with k < 2tt/L. This means that the lower 
dashed curve differs from the upper one by the linear theory 
velocity growth factor (ex ^/a in SCDM); on a log plot the 
two dashed curves have the same shape. (Although there 
is a slight offset — the theory curve should be multiplied by 
about 0.9 if it is to pass through the z = 20 symbols — it 
has the same shape as the symbols. Though they do not re- 
mark on it, a comparison of columns 9 and 17 in Table 2 of 
Colberg et al. 2000 shows the same offset.) 

Although the dashed curves appear to have the same 
shape as the symbols, both at z; = 20 and at z = 0, there is 
a systematic discrepancy: the model appears to underesti- 
mate the amount by which the speeds of massive haloes have 
evolved. This discrepancy is the same as that reported by 
Colberg et al. (2000). At face value, this suggests that the ac- 
curacy of the linear theory evolution of halo speeds depends 
on halo mass. In fact, we will argue that the actual evolution 
depends not so much on halo mass as on local density. It is 
well known that the evolution of clustering within dense re- 



gions is accelerated relative to the average (e.g. Tormen & 
Bertschinger 1996; Cole 1997) — we show that this applies 
to the evolution of halo speeds also. In particular, we argue 
that the evolution of halo speeds is best estimated by using 
an r^off which depends on (some suitably defined) local den- 
sity, rather than the global value f2o- We describe a simple 
model in which massive haloes populate denser regions, so 
that the dependence on density appears as a dependence on 
halo mass. 

After studying the accuracy of the predicted linear the- 
ory amplitude, this section turns to the direction of motion 
predicted by linear theory. It shows that nonlinear kicks to 
a particle's motion may substantially change the particle's 
direction. Although the amplitude of the nonlinear kicks are 
larger for particles which end up in massive haloes, these 
kicks are random in direction, so they can be estimated eas- 
ily. The directions of halo motions, on the other hand, are 
extremely well described by linear theory. That is to say, 
haloes identified at any given time are moving in essentially 
the same direction as predicted by linear theory. This pro- 
vides additional justification for our model in which non- 
linear effects are modelled as arising primarily from virial 
motions within haloes. 



3.1 Massive haloes populate denser regions 

Suppose one assumes that massive haloes form near peaks in 
the initial fiuctuation field, and that they are close to local 
minima in the initial potential field (Kaiser 1984; Bardeen 
et al. 1986). Colberg et al. (2000) show this is true for the 
most massive objects present at z = 0. In such a model, 
less massive haloes fall towards the massive ones, so one 
might expect the speeds of less massive haloes to be higher 
in regions which contain massive haloes. This leads to the 
question; Do massive haloes populate denser regions than 
average? 

Equation (W) for n{m\S) suggests that massive haloes 
occur predominantly in denser cells (recall that for massive 
haloes 6(771) increases with m). Although this is intuitively 
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obvious — one reason a cell is denser is because it contains 
massive objects — the Appendix provides a simple quantita- 
tive model of the average density of cells which are known to 
contain haloes of a given mass m. Therefore, in a model in 
which less massive haloes stream towards the more massive 
ones, one might expect the speeds of less massive haloes to 
be higher in denser regions than in less dense ones. More- 
over, since the evolution of clustering is accelerated in dense 
regions relative to the average, one might expect halo speeds 
to also evolve faster in dense regions than in underdense re- 
gions. 

In such a model, massive haloes populate dense regions, 
so the evolution of their speeds is best estimated by using an 
f2eff which is larger than the global value. Moreover, massive 
haloes which have massive near neighbours are, on average, 
in even denser regions. Since fl^H is even larger for the mas- 
sive haloes which have massive neighbours, one might expect 
the discrepancy with the linear theory evolution to be even 
larger for such haloes than for those which are relatively 
isolated. This is qualitatively consistent with what Colberg 
et al. (2000) found in their simulations: the discrepancy be- 
tween the linear theory predicted velocity at z = and the 
actual velocity of a halo at that time is larger for massive 
haloes which have massive neighbours nearby. 

This argument applies to less massive haloes also, al- 
though in this case things are complicated by the fact that 
less massive haloes occupy regions spanning a wider range of 
densities — their speeds evolve faster or slower than average 
depending on whether or not they are in denser or less dense 
regions. On average, however, less massive haloes occupy re- 
gions of about average density, and so using the global value 
of f2o to estimate the evolution of the population as a whole 
will be reasonably accurate. This is why using the global 
value of Qo to extrapolate from z = 20 to z = works well 
for less massive haloes, but results in an underestimate of 
the present day velocities of massive haloes. 

How does such a model compare with what happens in 
simulations? 



3.2 Comparison with simulations: speed 

Fig. [nJ shows the velocities of haloes at z = as a function 
of the 'local' density at that time. The local density was 
computed by smoothing the density field at z = with a 
Gaussian filter of radius Rg = 4Mpc//i. In the top two and 
bottom left panels, the bins in the x-direction were chosen 
to contain equal numbers of haloes, the error bars show the 
range containing 90 percent of the haloes in the bin, and the 
symbols show the median velocity and density for each bin. 
The three panels show haloes of different mass ranges. In all 
three panels, there is a clear trend for haloes in dense regions 
to move faster. Also, comparison of the three panels shows 
that, on average, massive haloes populate denser cells. The 
bottom right panel shows this trend with halo mass more 
clearly. The open squares, filled squares, open circles and 
filled circles show the median velocities of haloes, and the 
median densities of the cells populated by haloes containing 
60-100, 10^-10^ 10^-10** and 10^-10^ particles, respectively 
(the same symbols and ranges as in Fig. y). The error bars 
show the range in which 90% of the haloes in the given mass 
ranges lie (though we have not done so here, we could have 
estimated these ranges directly from the work of Sheth & 
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Figure 11. Dependence of halo speed at 2 = on local density 
at 2 = for haloes of different masses. Haloes in dense regions 
are moving faster than haloes in underdense regions. On average, 
massive haloes populate denser regions, and move slightly more 
slowly than less massive haloes. This is consistent with a model 
in which massive haloes are associated with minima in the initial 
potential field, and less massive haloes are falling towards these 
minima. 
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Figure 12. Dependence of halo speed at 2 = on local density at 
2 = 20, for haloes of different masses. At the present time haloes 
which were initially in dense regions are moving faster than haloes 
which were initially in underdense regions. This trend is true for 
haloes of all masses. 



Lemson 1999). At z = 0, massive haloes clearly populate 
denser regions, and they clearly move slightly slower than 
less massive haloes. 

We have also constructed such a plot using Rg ~ 
10Mpc//i. On this larger smoothing scale also, the trend 
for higher halo speeds in denser cells at z = remains: 
w oc (1 -f S)" with a > 0. Although a increases with Rg be- 
cause the range in densities is smaller at large Rg, whereas 
the halo velocities do not depend on Rg , most of this differ- 
ence in slope can be accounted for as follows. 

On average, dense cells at z = were also dense initially. 
The spherical collapse model allows us to estimate the initial 
overdensity So given the density 1 + S today (Mo & White 
1996 provide a simple fitting formula for this relation). If 
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Figure 13. Dependence of halo speed at z = 20 on local density 
at z = 20, for haloes of different masses. Except for a weak trend 
for haloes in dense regions to move more slowly (the trend is 
reversed at z = anyway) , there is little correlation between halo 
speed and surrounding density. This suggests that the tendency 
to move faster in denser regions, shown in the previous figure, 
must be due to evolution. 



we were to rescale the x-axes to show the initial So, rather 
than the density today, then this would move the bins with 
5 > to the left, and the bins with 5 < to the right. The 
magnitude of the shift would be larger for large values of 
\S\. As a result, the plot for Rq = lOMpc/ft would remain 
essentially unchanged (this just says that large regions have 
not changed their size very much since the initial conditions, 
and So ~ S for small S) whereas the plot for Rq = 4Mpc//i 
would change significantly. If we were to apply this rescaling, 
then V oc [l + ^o('5)]°', with the nearly the same value of a for 
a range of smoothing scales: 4 < Rq < 10Mpc//i. Of course, 
once the smoothing scale on which S is defined becomes very 
large, the trend with 'local' density vanishes. This scale is 
approximately the same as that on which the mass function 
of haloes approaches the global value: n{m\S) -^ n(m)[l + 
b{m)S], but (S< 1. 

Rather than apply this spherical collapse So{S) rescal- 
ing, we have done the following. We have taken all the par- 
ticles that are in a halo at 2 = 0, found their positions at 
z = 20, and plotted Vha.io{z — 0) versus 1 -I- 5 at 2; = 20. 
Fig. [id shows this when the smoothing scale used to define 
the 'local' density at z = 20 was Rq ~ 4Mpc//i. Notice the 
correlation between higher speeds and local densities. This 
figure should be compared with Fig. [13 in which the local 
densities were computed using Rq = 4Mpc//i at 2 = 20 
but the halo velocities were computed using the velocities 
the particles had at 2 = 20. The difference between the 
two plots is striking. Notice that, for all but the most mas- 
sive haloes, the velocities at 2 = 20 show no dependence 
on the local density at that time. Because a sphere of size 
Rq = 4Mpc/^ in which the overdensity is 5 at 2 = was 
actually larger at 2 = 20 [by a factor of (1 + 5)^ ], it is not 
really fair to compare plots in which the same smoothing 
scale is used for both times. We have found no dependence 
of Vhaio(2^ = 20) on local density at 2 = 20 even when the 
density is computed by smoothing with larger filters. (At 
Rg = 20Mpc//i, there is a weak trend for the halo speeds to 
be smaller in denser cells, even for the less massive haloes. 
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Figure 14. Dependence of the ratio of the halo speeds at 2 = 
and 2 = 20 on local density at z = 0, for haloes of different 
masses. Dotted lines in each panel show the linear theory predic- 
tion; dashed lines, the same in each panel, show equation (|22|). 
The measured ratio is higher in denser regions; velocities in dense 
regions evolve faster than in less dense regions. The dependence 
on density is approximately the same for all haloes — it does not 
depend on halo mass. 



It would be interesting to know if this were real. Though 
we have not done so, it should be possible to do the linear 
theory calculation for the predicted bulk motion of a region 
subject to the condition that the region of interest is em- 
bedded in a larger scale over/underdensity. For the special 
case in which the scales on which the initial velocities and 
densities are smoothed are the same, linear theory predicts 
no dependence of the initial velocities on the initial densi- 
ties, which is approximately what our bottom panel shows.) 
We interpret the fact that halo velocities at 2 = 20 are in- 
dependent of local density at 2 = 20, but halo velocities at 
2 = do depend on local density at 2 = 20 as strong evi- 
dence that the evolution of halo velocities depends on their 
environment. 

To quantify this, Fig.h4 shows the ratio of the median 
halo velocity at 2 = 0, vo to that a.t z — 20, U20, as a function 
of local density at 2 = 0, computed using Rg = 10Mpc//i. 
For the ACDM model studied here, the linear theory pre- 
diction is that fo/i'2a ~ 3.16; it is shown as the horizontal 
dotted curve in the panels. The different symbols in each bin 
show the different mass ranges. This shows that the evolu- 
tion of halo velocities depends more on environment than 
on halo mass. The dashed curves show 



(1 + Sr""' fn 



(22) 



vo 
1^20 

where fn oc Qq'^ is the growth factor predicted by linear 
theory, and fj,{R = 10 Mpc/h) — 0.6. The dashed curves in 
the three panels all show this same scaling — the dependence 
of halo speed on local density is independent of halo mass. 
The trend in the bottom right panel of Fig.h4 only reflects 
the fact that massive halos are in denser regions. 

As discussed earlier, ^i{R) depends on the scale R on 
which the local density field was defined. A reasonable ap- 
proximation to the dependence on R can be got by setting 
At = 0.6 a^{R)/o-^{10 Mpc/h), where a^^R) denotes the vari- 
ance in the present day density field when smoothed on scale 
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Figure 15. The dot product of the Hnear and actual directions of motion for dark matter particles in less massive (left), intermediate 
(middle) and massive (right) halos, as a function of overdensity. Symbols show the median and error bars show the range containing 
90% of the particles in each bin. The nonlinear kicks to a particle's motion are larger in massive halos. Dashed line shows the expected 
median if kicks are randomly distributed. 



R. Alternatively, as discussed earlier, one might have chosen 
to incorporate the dependence on R by using [1 + Sq{5)] in- 
stead. The main point we wish to make is that our simple fit, 
equation d23), strongly suggests that the dependence of the 
evolution of halo speeds on local density can be very simply 
parametrized by replacing the global value of Qo with what 
is, effectively, a local one. 

Because the rms speeds of halos depend on environ- 
ment, one might have wondered if the full distribution 
of halo speeds, rather than just the rms, depends on en- 
vironment. However, as we showed in the previous sec- 
tion (Fig. M), if one studies a sufficiently small range of 
halo masses and environments, then p{vhaio\m,5) is Gaus- 
sian/Maxwellian, to a good approximation. When the range 
in m and 5 is increased, the distribution develops a non- 
Gaussian large v tail; this happens not because the veloc- 
ities themselves are non-Gaussian, but because, when one 
considers a wide range in S, one sees the effect of summing 
up Gaussians which have a wide range of widths. 



3.3 Comparison with simulations: direction 

In addition to a speed, linear theory also predicts a direc- 
tion of motion. This subsection shows that the linear theory 
direction does not provide a good indicator of the direction 
of motion of dark matter particles at 2 = 0. It is, neverthe- 
less, a very good indicator of the direction of halo motions. 
Essentially, this is because the motion of a dark matter par- 
ticle receives substantial nonlinear kicks as it becomes part 
of a virialized halo — the amplitude of the nonlinear kicks 
increases with halo mass. Haloes themselves do not suffer 
substantial kicks to their motion. 

In what follows, we will interpret the difference vo — V20 
as the nonlinear kick to a particle's motion. Fig. WH shows 

cos = -. -; r (23) 

Iwolpaol 

as a function of local density for the dark matter particles 
for a range of halo masses. The figure clearly shows that the 
nonlinear kicks to a particle's motion are larger in massive 
haloes. If these kicks were randomly distributed in direction, 
then the present day direction would be the sum of two 
independent Gaussian vectors having different, but known, 
dispersions, o-ijjjio(7n) and a^i^{m), which can estimated from 



the results of the previous section (Fig. 0). This allows us 
to compute the distribution of cos6'. For random kicks the 
mean value should be 



1 



(cos f ) = 



v/TT^ 



r("*)Milo("l) 



(24) 



(This follows from the fact that if the kicks are random, then 
the initial direction of motion and the direction of the kick 
are, on average, at right angles to each other.) For the range 
of halo masses in the three panels above, this would require 
(cos 9) ~ 0.9, 0.6 and 0.3, which is in reasonable agreement 
with the Figure. 

Fig. hra shows the corresponding distribution for the 
halo motions. Although the nonlinear kicks to a halo's mo- 
tion are larger in denser regions, these kicks are almost al- 
ways smaller than the initial velocity. 

Fig. h/n presents one consequence of the fact that the 
nonlinear kicks to a halo's motion are larger in denser re- 
gions. The figure shows that the deflection Shaio is large both 
at small and large values of i;o/w20- Because large values of 
«o/^'20 are associated with dense regions (Fig. hj), the large 
range in 6'haio at large t;o/i;20 is, perhaps expected. The rea- 
son this happens at small i'o/v20 as well is slightly more 
subtle. There is a hint associated with the fact that the 
magnitude of the scatter is the same in the smallest and 
largest bins, and it is approximately the same as the scatter 
in the largest density bin in Fig. hq. Presumably, the haloes 
with small cos&haio and small vo/i'20 are precisely the ones 
which scatter low in the densest cells in Fig. hj; these are 
the haloes in dense regions which have suffered large kicks 
to their initial velocities, so they are now travelling in quite 
different directions than they were initially. Of course, be- 
cause these are the haloes in the densest bins, they are only 
a small fraction of the set of all haloes. Figs. Ilq and nj] show 
that, for most haloes, the linear theory direction is a good 
indicator of the direction of motion today. 

To summarize: This section makes two points. The first 
is that linear theory provides a good description of the ini- 
tial speeds of haloes, but tends to underestimate the rate 
at which the speeds of massive haloes evolve. Although 
second-order perturbation theory is the best way to study 
why this happens, our results suggest that this happens be- 
cause the evolution of halo speeds is faster in denser re- 
gions. The speeds of massive haloes evolve faster than linear 



© 0000 RAS, MNRAS 000, 000-000 



Peculiar velocities of galaxies and clusters 15 



12969 halos 60-10^ particles -- 1030 halos 10^-10^ particles -- 90 halos 10^-10^ particles 



■■■■■■n 



lfl»«pHTH- 



T 



0.5 1 2 

l + (5(z = 0) 



0.5 1 3 

l+5(z = 0) 



0.5 1 

l + 6(z = 0) 



Figure 16. The dot product of the hnear and actual directions of motion for less massive (left), intermediate (middle), and massive 
(right) halos, as a function of overdensity. Symbols show the median and error bars show the range containing 90% of the haloes in 
each bin. The nonlinear kicks to a halo's motion are small; they are larger in denser regions. Dotted lines (same as the previous figure) 
represent the typical kick received by a dark matter particle. 
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Figure 17. Relation between the direction of a halo's motion and the evolution of its speed. Panels show the dot product of the linear 
and actual directions of motion for less massive (left), intermediate (middle) and massive haloes (right) versus the ratio of the actual 
and initial, linear, speeds. The large scatter in directions at small and large fo/f20 are from haloes in dense regions. 



theory predicts primarily because they populate denser re- 
gions. Therefore, it may be sufficient to model the evolution 
of halo speeds by replacing the global value of the density 
parameter, Qo, in equation (Mj, with a local Sleff. Once this 
adjustment has been made, linear theory provides a good 
description of the distribution of halo speeds. In addition, 
linear theory provides a good description of the directions of 
halo motions. That is to say, once Qq -^ r^cff, an approach 
based on the Zeldovich approximation should be reasonably 
accurate at describing halo motions. The same is not true, 
of course, for the motions of dark matter particles; particles 
receive substantial nonlinear kicks as they become incorpo- 
rated into clusters. These kicks are approximately randomly 
distributed in direction. 



3.4 Comparison with simulations: Virial motions 

We have assumed throughout that the virial term depends 
on mass more strongly than it does on local density. For 
completeness, we think it useful to show that this is accurate. 
The symbols with error bars in the four panels in Fig. ng 
show (Tvir as a function of local density for ACDM haloes in 
four different mass ranges; least massive haloes are top left, 
and most massive haloes are bottom right. The mass cuts 
were chosen to be the same as in Fig. Q, so the symbols in 
this figure are the same as in that one. And as before, the 
error bars show the range in each bin within which 90% of 



the haloes are. The density was defined by smoothing the 
particle distribution on a grid with a Gaussian filter of width 
4Mpc//i. The dotted curves show the value of a^ir predicted 
by the virial theorem (our equation m for the associated 
mass range. The dotted curves were computed using the 
global value of the density parameter f2o, not the local one. 
They provide a reasonable fit to the simulations. For mas- 
sive haloes (bottom panels), CTvir appears to be higher in 
denser regions. This is almost entirely a consequence of the 
fact that the densest cells contain the more massive haloes 
in the mass bin. To illustrate this, we have computed the 
median halo mass in each density bin, using only the haloes 
in the requisite mass range for each panel, and used this 
mass to compute Uvir. These values are shown as the crosses 
in each panel — they have been offset downwards by a fac- 
tor of 0.25 (i.e., the plot shows the virial velocity divided 
by lO"'^^) for clarity. The crosses show the same trend as 
the main symbols, demonstrating that the virial term really 
does depend on mass, and not on local density. 



4 DISCUSSION 

We described a model in which galaxy velocities were writ- 
ten as the sum of two terms, one of which was effectively 
described using linear theory, and the other which was inher- 
ently nonlinear. We showed that this split was both useful 
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Figure 18. Dependence of virial motions on iialo mass and local 
density in the ACDM simulations. Symbols in the top left panel 
show the lowest mass haloes, symbols in the bottom right panel 
show the most massive haloes. Dotted curve shows equation (M) 
computed for the median halo mass in the panel, and using Qq, 
the global, not the local, value of the density. Crosses (offset down- 
wards for clarity) show the result of inserting the median mass in 
each bin into equation (pT) to compute a^i,.. 



and accurate for clustering which develops from Gaussian 
initial conditions, although our model should also work for 
more general initial conditions. For Gaussian initial condi- 
tions, a generic prediction of our model is a Gaussian core, 
with exponential wings for the distribution of galaxy pecu- 
liar velocities. The core mainly reflects the fact that most 
dark matter haloes move with the same speed, and this 
speed is less than the virial motions within clusters (both 
these are consequences of the Gaussian initial conditions); 
the wings arise primarily from the large nonlinear virial ve- 
locities within massive clusters. We compared our model 
with one dimensional peculiar velocity distribution functions 
f{v) dv extracted from dissipationless N-body simulations, 
where galaxies were formed and evolved with semi-analytic 
techniques. Our model, which explicitly considers whether 
dark matter particles, galaxies or galaxy clusters trace the 
velocity field, accounts for the shape of f{v) dv extremely 
well. 

We are not the first to have considered the shape of the 
galaxy distribution function statistic. Saslaw et al. (1990) 
present a formula for f{v) dv which is accurate for N-body 
simulations of clustering from Poisson initial conditions. The 
virtue of our approach is that it shows clearly how the distri- 
bution should depend on trace particle type. Raychaudhury 
& Saslaw (1996) present a measurement of f{v) dv using a 
sample of spirals compiled by Mathewson, Ford & Buchhorn 
(1992). It would be interesting to do the same with the more 
recent SFI and ENEAR data sets of spirals and ellipticals. 
Since our models depend explicitly on cosmology, it may be 
that these data sets are able to place interesting constraints 
on the density parameter. 

Sections argued that halo speeds evolve faster in denser 
regions. The fact that massive haloes populate denser re- 
gions has consequences for estimates of flo, studies of how 
and why the local value of the cosmic Mach number may dif- 
fer from the global one, and the detection of the kinematic 
Sunyaev-Zeldovich effect. 



Suppose one were to measure the peculiar velocities of 
galaxies. Since measuring distances is difficult, the veloc- 
ity of any one galaxy is often ill-determined. To increase 
signal to noise, one often 'groups' galaxies, before using 
their 'grouped' velocities. If these groups correspond approx- 
imately to dark matter haloes, then this is useful, because 
we showed that linear theory describes the directions of halo 
motions rather well. However, one often then weights all sub- 
sequent analyses by the signal-to-noise ratio. In effect, this 
means that the signal-to-noise weighted grouped velocities 
are dominated by the motions of massive haloes. Since the 
motions of massive haloes evolve more quickly than aver- 
age, they have a higher effective density parameter than 
the global value. It is interesting, therefore, that peculiar 
velocity based estimates of flo are often higher than those 
indicated by almost all other observations (e.g.. Smith et 
al. 1999; Branchini et al. 2000; Zaroubi et al. 2000; but see 
Borgani et al. 2000). 

The cosmic Mach number is the ratio of the large scale 
bulk velocity to the smaller scale velocity dispersion (Os- 
triker & Suto 1990). Recent work (van de Weygaert & Hoff- 
man 1999) suggests that local measurements of this Mach 
number may differ from the global one. Because our model 
is also phrased in terms of bulk flows and dispersions, it can 
be used to address this issue analytically. For example, it al- 
lows one to study how the distribution of the Mach number 
statistic depends on local density. 

An obvious consequence of our results is that large 
overdense regions, namely superclusters, are ideal places to 
look for the kinematic Sunyaev-Zeldovich effect: in these re- 
gions, low mass haloes, with relatively low optical depth, 
can have substantial peculiar velocities; they thus can yield 
detectable kinematic Sunyaev-Zeldovich fluctuations in the 
Cosmic Microwave Background radiation (Diaferio, Sunyaev 
and Nusser 2000). In this context, our model for the distri- 
bution of halo speeds, and its dependence on density, is then 
related to the distribution of temperature fluctuations one 
might measure. 

Our results suggest that whereas nonlinear effects can 
be thought of as adding Gaussian noise to the initial lin- 
ear theory velocity, this noise is higher for particles in mas- 
sive haloes. Therefore, the noise is spatially dependent — 
adding or subtracting a global thermal noise before compar- 
ing observations with theory may lead to inconsistencies. 
Should one choose to use a global thermal noise anyway 
(e.g. Freudling et al. 1999), then the 'temperature' associ- 
ated with ellipticals (e.g. the ENEAR catalog) , which occur 
more in clusters, will be larger than that for spirals (e.g., 
the SFI catalog) which occur more in the field. 

Our derivation of the single particle velocity distribu- 
tion function is the first step in a larger program. Baker, 
Davis & Lin (2000), following Davis, Miller & White (1997), 
consider a statistic which is essentially a smoothed version 
of our /(«) distribution. The operation of smoothing is es- 
sentially one of adding several, possibly correlated, veloci- 
ties. So, to compute the smoothed statistic from our model 
requires knowledge of how the halo velocity correlation func- 
tion depends on halo mass. We are in the process of extend- 
ing our model to include the effects of smoothing. 

Finally, our results suggest that, once account has been 
taken for the effect of the local density on the evolution of 
velocities, linear theory (or the Zeldovich approximation) 
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should provide a reasonably good estimate of the motions of 
haloes. This simplifies models of redshift-space distortions 
considerably (work in progress). Catelan et al. (1998) de- 
scribe a dark-matter halo based model of the density field 
in which motions are approximated using the Zeldovich ap- 
proximation. Our results suggest that if their approach were 
to be extended to describe redshift-space distortions, then 
it should be quite accurate. 
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APPENDIX A: THE DENSITY IN CELLS 
CONTAINING HALOES 

Massive haloes populate denser cells: this Appendix provides 
a simple estimate of how the mean density in cells which 
contain at least one halo of mass m increases as m increases. 
Let p{5) dS denote the probability that a cell of size V 
has overdensity in the range dS about 5, and let p{j\5) denote 
the probability that such a region in which the overdensity is 
S contains j haloes of mass m. The mean overdensity within 
cells of size V which contain at least one halo of mass m is 

., , , , , , \ Jd5SpiS)[l-piJ^0\6)] 

at least one halo ) — ^ , 

/ J dS piS) [1 - p{j = 0\5)] ' 

where the denominator is the fraction of cells that contain 
at least one halo of mass m. 

If p{j\5) were Poisson, then the probability that j = 
would be e-"(™l'')^. Sheth & Lemson (1999) show that this 
is a reasonable assumption on large scales. Since there are 
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many more low-mass haloes than massive ones, this expo- 
nential term tends to zero for small masses, and so the in- 
tegrals above reduce to (S) = 0. This says that the average 
overdensity in cells containing at least one less massive halo 
is zero. Massive haloes are slightly more complicated. To 
see how the integrals scale, it is useful to consider the large 
scale limit. In this limit, the linear bias model is reason- 
ably accurate: n{m\S) ~ n(m) [1 -I- h{m)S\, where b{m), the 
linear bias factor, increases as the mass m of the haloes of 
interest increases. To simplify things even further, suppose 
that n{m\S)V ^ 1 (although this assumption is inconsis- 
tent with the assumptions of linear bias and Poisson statis- 
tics, it helps illustrate our argument more clearly). Then 
1 " P{j = 0|5) ~ n{m)V [1 + 6(m)(5] and the integrals above 
become 

( 5|at least one halo ) = —. -^ = b(m) (5 >, 

\ / {n{m)V [1 + b{m)5]) ^ ' 

where we have used the fact that (5) = 0. Since h(ni) in- 
creases as m increases, this says that at fixed V, the mean 
density of cells in which there is at least one halo of mass 
m increases as m increases. This is in qualitative agreement 
with Fig. O which shows that the median density of cells 
which contain at least one halo of mass m increases as m 
increases. 
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